Assessment of a polygenic hazard score for the onset of pre-clinical Alzheimer’s disease

Abstract Background With a growing number of loci associated with late-onset (sporadic) Alzheimer’s disease (AD), the polygenic contribution to AD is now well established. The development of polygenic risk score approaches have shown promising results for identifying individuals at higher risk of developing AD, thereby facilitating the development of preventative and therapeutic strategies. A polygenic hazard score (PHS) has been proposed to quantify age-specific genetic risk for AD. In this study, we assessed the predictive power and transferability of this PHS in an independent cohort, to support its clinical utility. Results Using genotype and imaging data from 780 individuals enrolled in the Australian Imaging, Biomarkers and Lifestyle (AIBL) study, we investigated associations between the PHS and several AD-related traits, including 1) cross-sectional Aβ-amyloid (Aβ) deposition, 2) longitudinal brain atrophy, 3) longitudinal cognitive decline, 4) age of onset. Except in the cognitive domain, we obtained results that were consistent with previously published findings. The PHS was associated with increased Aβ burden, faster regional brain atrophy and an earlier age of onset. Conclusion Overall, the results support the predictive power of a PHS, however, with only marginal improvement compared to apolipoprotein E alone. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08617-2.


Introduction
The ε4 allele of the apolipoprotein E (APOE) gene is the strongest genetic risk factor for late onset AD [1]. However, polygenic or oligogenic [2] contributions to AD are now widely acknowledged. In recent years, genome wide association studies (GWAS) and next-generation sequencing efforts have facilitated the identification of a number of disease-onset-associated single nucleotide polymorphisms (SNPs) with much smaller effect size [3,4]. Collectively, these genetic variations could make a significant contribution to age-at-onset AD risk. The combined effect of these inherited genetic variations can be quantified and expressed as a polygenic risk score (PRS). The development and validation of a reliable PRS for AD could contribute to solving a major public health challenge by identifying the age-at-onset of individuals at high risk of developing the disease and therefore enable early screening and preventive therapies [5]. Typically, PRSs are constructed as the weighted sum of allele counts, where the weights correspond to the effects (β coefficients) of each SNP, extracted from a SNPs-disease association analysis (e.g. logistic regression for casecontrol studies) [6]. The development of PRSs for age-atonset of AD have demonstrated promising results with prediction capabilities showing over 80% accuracy in some cases [7,8].

Open Access
Data used in the preparation of this article was obtained from the Australian Imaging Biomarkers and Lifestyle study (AIBL). *Correspondence: michael.vacher@csiro.au 3 Collaborative Genomics and Translation Group, School of Medical and Health Sciences, Edith Cowan University, Joondalup 6027, Western Australia Full list of author information is available at the end of the article A polygenic hazard score (PHS) approach has also been developed for AD [9]. This PHS goes beyond AD risk prediction by providing estimates of individual age-specific risk for developing AD. Moreover, the PHS has been shown to be associated with Aβ-amyloid (Aβ) accumulation, accelerated cognitive decline and neurodegeneration in susceptible brain regions. In this study we aim to assess the useability and replicability of PHS in an independent cohort.

Sample population
The study used data from the Australian Imaging, Biomarkers and Lifestyle (AIBL) cohort, of which the design, enrolment process, neuropsychological assessments and diagnostic criteria have been previously described [10]. Of the 1572 participants enrolled in the AIBL study, we restricted the analysis to individuals having both genotype and imaging data available (N = 780). Longitudinal data was collected every 18 months over multiple years (mean = 4.8 years, SD = 2.1). Participants were classified as those with Mild Cognitive Impairment (MCI) [11] or AD [12] when the clinical criteria for diagnosis were met. In the absence of these features a classification of Cognitively Normal (CN) was given by a clinical review panel, blinded to Aβ-PET status (see below). Ethics approval for the AIBL study and all experimental protocols was provided by Austin Health, St Vincent's Health, Hollywood Private Hospital and Edith Cowan University. All experiments and methods were carried out in accordance with approved guidelines and regulations and all volunteers gave written informed consent before participating in the study.

MRI and PET imaging
All subjects underwent a 3 T MRI and Aβ-PET imaging. T1 MPRAGE MRI was obtained at 3 T using the Alzheimer's Disease Neuroimaging Initiative (ADNI) magnetization-prepared rapid gradient echo (MPRAGE) protocol, with in-plan resolution of 1 × 1 mm and 1.2 mm slice thickness. Freesurfer was used to estimate all cortical volumes from the T1 [13]. All volumes were corrected for age and ICV using a regression approach and reference population composed of healthy subjects (CN, Aβ negative, MMSE> 28, CDR = 0, APOE non-ε4). Left and right volumes were averaged.
Aβ-PET imaging was performed with one of five radiotracers: the Centiloid (CL) scale was used to provide a standard scale for Aβ-PΕΤ quantification [14], with 0 representing the typical Aβ-PΕΤ in young controls, and 100 the typical Aβ-PET in mild AD patients. Values equal to or above 20 CL were considered representative of abnormal levels of Aβ deposition [15]. CL were generated using CapAIBL software [14] and estimated in three regions of interest (whole neocortex, frontal and posterior cingulate). CL values were also projected onto the individual cortical surface and then transferred to a cortical atlas where statistical analyses were performed [16].

Cognitive scores
All AIBL participants complete a battery of neuropsychological tests as previously described [10]. The resulting data were used to calculate cognitive composite scores to assess recognition memory, executive function and episodic recall memory. Briefly, the composites were computed by standardising the outcome measure for each neuropsychological test to be included, using the baseline mean and standard deviation for the cognitively normal sample, then averaging those standardised scores. Each composite consists of the following tests; recognition memory (California Verbal Learning Test Second Edition and Rey Complex Figure Test), executive function (Controlled Oral Word Association Test and Category Switching), and episodic recall memory (California Verbal Learning Test Second Edition, Logical Memory II, and Rey Complex Figure Test) (Harrington reference). Clinical Dementia Rating (CDR) sum of boxes (CDRSB) score was also used to assess clinical progression.

Age of onset definition
The age of onset of abnormal levels of amyloid deposition was determined as follows: First a progression curve giving for Aβ-amyloid deposition as a function of disease progression time was constructed as described in [17,18]. Then, the participants' age of onset was estimated by using the progression curve to calculate the elapsed time between a participant passing the CL threshold and reaching their mean longitudinal Aβ-amyloid levels. Finally, this value was subtracted from their mean longitudinal age to get the age at onset.

Genetic data and Polygenic hazard score
Genome wide genetic data was ascertained from the OmniExpressHumanExome+ BeadChip (Illumina, USA) as previously described [10]. A polygenic hazard score was then derived from this genetic data for each individual following the methodology described previously [9]. Briefly, the approach consists in three steps. First, a list of 1854 SNPs implicated with AD (p-values < 10 − 5 ) was extracted from the published summary statistics (p-values and odds ratios) generated by the IGAP consortium [4]. Second, a forward stepwise regression to identify a subset of 31 SNPs, in addition to the two APOE variants, that were associated with AD age of onset. Finally, for each patient, a polygenic hazard score predicting the individual's risk of developing AD, given their polygenic profile and age was derived. In the analyses we assessed the PHS as a continuous measure, quantifying individual risk for AD, and as a dichotomous variable (high and low). To directly replicate the findings from Tan et al. [19], we used the same grouping method, defining high PHS by 1 standard deviation (SD) above the mean and low PHS by 1 SD below the mean (Supplementary Fig. 1).

Statistical analyses Association with cross-sectional Aβ deposition
We used linear regression to investigate the relationship between PHS and regional brain Aβ deposition at baseline. Three main areas were investigated: neocortical, frontal cortex and posterior cingulate. In this cross-sectional analysis, we controlled for age, gender, level of education in years and APOE ε4 status (0 = no ε4 allele, 1 = 1 or 2 ε4 alleles). To evaluate the contribution of PHS and APOE ε4 status in the linear regression, we used likelihood ratio tests to compare models with and without these terms.
To further assess the association between PHS and Aβ deposition, beyond the role APOE, we performed the same analysis in two sub-cohorts containing exclusively APOE ε4 carriers (N = 278; CN = 161, MCI = 58, AD = 59) and non-carriers (N = 502; HC = 412, MCI = 66, AD = 24). In these subsequent analyses, we used the same linear regression, however, we did not control for APOE ε4 status due to the lack of variation in these sub-populations. All the results were adjusted for multiple comparisons using false discovery rate (FDR). The same analysis was performed at a vertex level on a template cortical surface.

Association with regional brain atrophy
We used linear mixed-effects models to evaluate the relationship of PHS with longitudinal volume change in 33 regions of interest from the Desikan-Killiany atlas in Freesurfer [20]. As volumes were previously controlled for intra-cranial volume (ICV) and age using a healthy sub-population we only controlled for gender, level of education in years and APOE ε4 status (0 = no ε4 allele, 1 = 1 or 2 ε4 alleles). In these analyses, we also controlled for sex, education, and APOE status. We then examined the simple effects by comparing slopes of volume loss over time for individuals at high (+ 1 SD) and low (− 1 SD) levels of PHS [19,21].

Association with longitudinal cognitive decline
For comparative purposes, we used the same linear mixed effects models as described in the original Desikan paper [19]. The only deviation made from this model was the use of the CL value from the frontal cortex instead of the standard uptake volume ratio (SUVR). Therefore, the final linear mixed effect model was defined as follows: This model was used to investigate the association between PHS and cognitive decline and clinical progression rate (represented as Δ c ) across four measures: recognition, executive function, episodic recall and CDR-SB. In this model, Time represented the number of years since the baseline visit. The APOE term indicated the presence/absence of APOE ε4 allele, encoded as a binary variable (0 = no ε4 allele, 1 = 1 or 2 ε4 alleles) and the term (1| Patient) corresponded to the random intercept. Continuous variables were centred and scaled in all the analyses. Further, to assess the original model and limit potential over-specification, we used a stepwise variable selection approach (backward selection) and identified a reduced model based on superior model fit (Akaike information criterion).

Association with age of onset of abnormal levels of Aβ deposition
Cox proportional hazards models of survival were performed to compare the time taken to reach abnormal levels of neocortical Aβ between participants with low versus high PHS scores (threshold at 1.04), adjusted baseline age, gender and years of education. The definition for survival time was the number of years between birth and a) having a PET scan indicating abnormal levels of Aβ (classed here an event, age of onset), b) withdrawing from the study (censored), or c) the last completed follow-up examination without an event (censored). For some individuals it was necessary to impute the date at which their Aβ levels became abnormal as previously published [22]. The age at which 50% of the cohort (median age) reached abnormal levels of Aβ, was reported.

Demographics
The study was conducted on a cohort of 780 participants enrolled in the AIBL study [23] (422 females, 358 males). Population demographic information is Table 1

Association with cross-sectional Aβ deposition
Using linear regression to evaluate the relationship between PHS and Aβ burden at baseline, we found that increased PHS values were associated with higher Aβ deposition. Similar results were obtained when looking at CL values specifically in the three areas investigated, neocortical, frontal cortex and posterior cingulate, with stronger effects systematically obtained in the posterior cingulate region, followed by the frontal cortex and then

Association with regional brain atrophy
In the linear-mixed effect analysis, PHS was significantly associated with cortical volume changes in most regions of interest. The effects were strongest in regions from the AD cortical signature, including temporal lobe, entorhinal cortex, posterior cingulate and precuneus. The cortical atrophy was fastest in individuals with high PHS (Fig. 1). However, when looking at non-ε4 carriers only, none of the associations between PHS and cortical atrophy remained significant after controlling for multiple comparisons. In addition, the effect sizes were considerably reduced (Supplementary Fig. 2).

Association with cognitive decline
In the linear mixed model analyses, we did not find any significant association between PHS and cognitive decline. The analyses were restricted to non-AD individuals (CN + MCI and CN only) and the results were consistent across the three cognitive domains investigated (Table 3). Further, comparing individuals with high PHS against low PHS, there was no significant difference in rate of cognitive decline between the two groups. Likelihood ratio tests comparing linear-mixed effects models with and without the PHS term, showed that the presence of PHS in the model resulted in minor improvement  in CDR-SB (X 2 = 6.29, p = 0.043) and Episodic recall (X 2 = 6.58, p = 0.037). However, the addition of PHS did not result in a better fit in the model predicting Executive function (X 2 = 2.83, p = 0.24).

Association with age of onset of abnormal levels of Aβ deposition
The age of onset of individuals with a high PHS (> 1.04) was significantly lower than individuals with a low PHS (<− 0.67). The median age at which AIBL participants with a high PHS reached abnormal levels of Aβ (Fig. 2

Discussion
Over the last decade, GWAS have clearly demonstrated that common complex diseases are typically associated with hundreds or thousands of genetic markers, collectively contributing to disease risk. This highly polygenic underpinning has been utilised in PRS by aggregating the effect of multiple genetic variants into a single score to predict disease risk. PRS have shown predictive values in several complex disorders, however, the approach does not account for age of onset, which is critical for neurodegenerative disease such as AD. The recent development of polygenic hazard score overcomes this limitation by predicting individuals' age-specific risk of AD development [9]. In this study, we attempted to replicate the findings from Tan et al. [19] and assess the utility of a PHS in an independent cohort. While investigating the relationship between PHS and cross-sectional Aβ deposition, we found that the PHS had slightly better predicting capabilities than using APOE ε4 status alone. This result was expected considering the effects of the APOE ε2 and . The population was stratified by PHS, high (mean + 1 SD) versus low (mean -1 SD). Shaded areas indicate 95% confidence intervals ε4 alleles were already accounted for in the calculation of the PHS. However, the marginal improvement in models to predict Aβ deposition observed when using PHS, suggests that the number of APOE ε4 alone can provide comparable predictive information. The major contribution of APOE in the PHS definition is evident when looking at the stratified score distribution ( Supplementary  Fig. 1). Individuals falling in the low PHS bracket (1 SD below the mean) are all ε4 non-carriers, while all ε4/ε4 individuals, conversely, are found in the high PHS group (1 SD above the mean). Nonetheless, we were able to find associations between PHS and regional Aβ deposition and brain atrophy in non-ε4 individuals. Although with relatively small effect sizes, these associations are consistent with the findings reported by Tan et al. [19] and support the claim that the PHS can provide predicting capabilities beyond APOE status.
Investigating cognitive domains, we did not find any evidence of association between PHS and longitudinal cognitive decline. These analyses were performed on non-demented individuals (N = 697) with a proportion of 82% CN and 18% MCI. This ratio was very different from the study cohort utilised in Tan et al., which consisted of about 36% CN and 64% MCI. Considering that healthy individuals have a much less pronounced cognitive decline trajectory, the over-representation of CN in our dataset could have impacted the results. Further, the linear mixed model used in this study was slightly different than the one presented in the original paper. Specifically, we used the CL values instead of the frontal florbetapir SUVR. Although, it is not expected that this deviation would generate differing results, they could have impacted and decreased the strength of the associations between PHS and cognitive decline.
Investigating the associations between PHS and age of onset, we found that individuals with a high PHS had a significantly younger age of onset (67.6 years) than low PHS individuals (80 years). Among the APOE ε3/ε3 population, high PHS individuals had an expected age of onset approximatively 8 years younger than low PHS individuals. Although the difference was less noticeable in the ε3/ε3 population, this result showed that the polygenic information, beyond the APOE ε4 allele, was useful for predicting the age of disease onset.
Compared to traditional polygenic risk scores, which provide a lifetime risk of developing a disease, the polygenic hazard scores provide prediction on age-specific risk of disease development. This estimation of instantaneous risk for developing AD is valuable additional information, as it could improve monitoring disease progression and facilitate timely intervention. In this study, we showed that the PHS had utility in predicting abnormal Aβ deposition, brain atrophy and age of disease onset. However, the results suggested that APOE genotype alone could provide comparable predictive capabilities, suggesting that APOE remains the main component of the PHS. This disproportionate contribution is mainly due to the large weight attributed to APOE alleles, which shadows the less extensive effects of the other diseaseassociated variants utilised in the score calculation. This is the case in both PRS and PHS approaches and reflect the difficulty in classifying AD genetic risk above and beyond APOE ε4. Recent studies have suggested that the use of less stringent significance threshold for SNPs selection could results in better performing scores [7,8,24]. This strategy could potentially further improve the prediction accuracy of the PHS. However, caution must be taken as the inclusion of many SNPs could lead to over-specification and result in a score that performs poorly in other cohorts. Furthermore, in addition to nonmodifiable factors, PHS and PRS could both be enriched with the introduction of lifestyle and environmental factors, known to be associated with the disease. Moving forward, we can anticipate that combining genetic, lifestyle and environmental components will facilitate the development of more refined and personalised risk profiles that will become relevant in clinical settings.